SOME FURTHER COMMENTS 

The author gratefully acknowledges an anonymous corre- 
spondent who politely pointed out that the caption in Figure 2 
should be changed from "the integral. . . cannot be explicitly 
solved" to ". . . cannot be explicitly solved by idiots." The inte- 
gral in fact is solvable by non-idiots using the well known tech- 
nique of scanning the pages of Gradshteyn and Ryzhik [Table 
of Integrals, Series, and Products, Academic Press, 1965]; ref- 
erence to equation 3.386 in that text leads to the exact closed 
form solution: 



Although the basic conclusions of the paper are unaffected, a 
number of the expressions can be considerably simplifed by 
substituing this formula. This idiot apologizes to any readers 
who may be inconvenienced by the oversight. 

Also, the derivation in section 7.1 that the K 2 entropy is 
zero agrees with the numerical observation in a recent paper 
by A. Provenzale, A. R. Osborne, and R. Soj ["Convergence 
of the K2 entropy for random noises with power law spectra" , 
Physica D 47 (1991) 361-372]. 
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Abstract 

It has recently been observed that a stochastic (infinite degree of freedom) time series with 
a 1/ f a power spectrum can exhibit a finite correlation dimension, even for arbitrarily large 
data sets. [A. R. Osborne and A. Provenzale, Physica D 35, 357 (1989).] I will discuss 
the relevance of this observation to the practical estimation of dimension from a time series, 
and in particular I will argue that a good dimension algorithm need not be trapped by this 
anomalous fractal scaling. Further, I will analytically treat the case of gaussian 1 / f a noise, 
with explicit high and low frequency cutoffs, and derive the scaling of the correlation integral 
C(A, r) in various regimes of the ( A, r) plane. 
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1 Introduction 



While a number of algorithms for estimating degrees of freedom directly from a time series 
have been proposed (see Ref. [1] for a review), the correlation dimension of Grassberger 
and Procaccia [2] remains the most popular. Although these algorithms are known to be 
fraught with pitfalls [1, 3, 4, 5], it is generally considered that with a long enough time series, 
it should be possible in principle for a numerical algorithm to distinguish low-dimensional 
chaos from stochastic noise. This notion was recently challenged in a remarkable paper by 
Osborne and Provenzale [6], hereafter OP, which found that for a stochastic time series with 
a 1 / f a power spectrum, the numerical estimate of correlation dimension is a small finite 
value D = 2/ (a — 1) for 1 < a < 3, even for arbitrarily long time series. 

This observation raises at least two serious questions. One, can such an algorithm be 
fooled into incorrectly characterizing a stochastic time series as low-dimensional? Two, is 
it possible or even common for low- dimensional chaos to exhibit a 1 / f a power spectrum? 
Sections 2 and 3 will address these questions. 

In Section 4, I will discuss the anomalous scaling of OP in qualitative terms. Section 
5 will introduce the model of 1 / f a noise with high and low frequency cutoffs; section 6 
will develop some notation and useful intermediate results; and section 7, the correlation 
integral C(N, r) will be derived, and several distinct scaling regimes in the (iV, r) plane will 
be delineated. Thus, the practical limitations of real- world ("almost 1//"") noise on the 
correlation dimension estimator can be quantified. 

2 Fooling a dimension estimator 

The suggestion of OP is that numerical estimation of dimension may not be adequate to 
the task of distinguishing low-dimensional chaotic systems from stochastic processes. This 
section will address the question "Can a good dimension algorithm be fooled?" 

2.1 Stationarity 

The first requirement of a time series which is to be analyzed by a dimension algorithm is 
that it be stationary. No sensible researcher would attempt to estimate the dimension of 
(say) the Lorenz attractor from a time series that is only half a cycle long, no matter how 
many points are in the time series. Blind application of a dimension algorithm might give 
the answer that the dimension is one, and of course it is true that such a short segment of 
the trajectory is one-dimensional. But the application of common sense would reject this 
answer as irrelevant to the geometry of the strange attractor, of which the small segment is 
a poor sample. In general, it is unwise to attempt to compute the dimension of a strange 
attractor unless the length of the data set is much longer than the length of the dominant 
period. This point has also made by Mayer-Kress [7], who notes that for 1 / f a noise, the 
dominant period is always on the order of the length of the time series. The operational 
definition of stationarity is that there be no significant power at low frequencies, and this is 
violated by 1/ f a noise. 
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2.2 Autocorrelation 



An important property of 1/ f a noise is that the autocorrelation time is long. The sensitivity 
of dimension algorithms to autocorrelation is well known [1, 4, 8]. For finite autocorrelation, 
one can be assured that a dimension algorithm will not give spuriously low dimension only 
if the length of the time series is much longer (sometimes very much longer) than the au- 
tocorrelation time. If, as in the case of 1/ f a noise, the autocorrelation time scales with the 
size of the data set, then it is no longer sufficient to say that the N — > oo limit will fix the 
problem. 

In fact, a good estimator of correlation dimension will avoid all pairs of points closer 
together in time than the autocorrelation [8]. For 1 / f a noise, this excludes all the pairs; the 
output of the algorithm will not be a spurious dimension, but a plea for more data. 

2.3 The method of surrogate data 

In any case, it is recommended that the effects of linear correlation be guarded against 
with a test such as the following. Given a time series, create an ensemble (in practice, 
five or ten) of "surrogate" time series which are random except that they have the same 
variance and autocorrelation as the original time series. A practical way to do this is to take 
the Fourier transform of the original data, randomize the phases, and then take the inverse 
Fourier transform. Now, any statistic computed from the original data (dimension, Lyapunov 
exponent, etc.) should then be computed from the surrogate data as well. If the results of 
the computation are significantly different for the original data compared to the surrogate 
data (and since there are several sets of surrogate data, the statistical significance can be 
quantified), then the null hypothesis that the the original time series is linearly correlated 
noise can be rejected. 

Attempting to reject the null hypothesis is a less ambitious goal than attempting to esti- 
mate the dimension, but if the null hypothesis cannot be rejected, then there is no point esti- 
mating dimension. Applied to a time series generated by Fourier inverting a f / f a spectrum 
(this is how 1 / f a noise is generated in OP), this test will by construction fail to reject the 
null hypothesis. 

A simplified version of this approach was used by Grassberger [9] to reject a claim of 
low- dimensionality in a climatic time series. The systematic approach advocated here is also 
described in Refs. [f , 4]. 

3 The power spectrum of a chaotic attractor 

The ubiquity of 1/ f noise sources in physical systems is well established [fO, ff, 12], and 
remains a subject of much interest. Recently, a variety of nonlinear dynamical systems have 
been investigated which exhibit power spectra with 1/ f a scaling. 

Pomeau and Manneville [13] observed that intermittent behavior near a critical bifurcation 
point could exhibit a 1/ f a spectrum; this was quantified with a renormalization-group by 
Procaccia and Schuster [14]. Here, the 1 / f a behavior is only seen at non- generic critical 
points. Also in this case, the motion is bounded, which requires a < 1. 
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Keeler and Farmer [15] observed 1/ f a spectra with a near 1 in a spatio-temporal system 
of coupled maps. The "sandpile" model of Bak, Tang, and Weisenfeld [16], also exhibits 
l/f a spectra, as part of its self-organized critical behavior. In both of these cases, the 
dynamical system has many degrees of freedom. Further, there is a low frequency cutoff 
which corresponds to the finite spatial size of the system. A simple Hamiltonian model of 
a particle in an unbounded periodic potential (e.g., the motion of a classical electron in a 
crystal) was described by Geisel, et al. [17]. With only four degrees of freedom, the dynamics 
of this system exhibits a 1 / f a spectrum, with a > 1, and with no apparent cutoff at low 
frequency (though, presumably, if the crystal was of limited size, then the motion would be 
bounded, and a low frequency cutoff would be observed.) 

Osborne and Caponio [18] contrived a Hamiltonian with a power-law scaling, and the 
resulting dynamics were then shown to scale as 1 / f a over a limited range of frequencies. 

In the above examples, the 1 / f a scaling at low frequencies was emphasized. It is also the 
case that for a dynamical system that arises from a set of coupled infinitely differentiable 
ordinary differential equations, the power at high frequency must decrease faster than any 
power-law. The high frequency scaling for noisy dynamical systems is a topic of significant 
interest (see Refs. [19, 20, 21] for discussion from various points of view), but for the purposes 
of this Letter, I will argue in a subsequent section that it is the low frequency scaling that 
is important. 

While a true 1 / f a spectrum cannot be generated by chaotic motion on a bounded low- 
dimensional strange attractor, power spectra which exhibit 1/ f a scaling over a wide range 
of frequencies are quite common in practice. From this point of view, the results of OP are 
indeed relevant to the practical estimation of dimension from a time series that might arise 
from low- dimensional chaos. 

The practical solution is the same as has been described earlier for ordinary autocorrelated 
data [8]: take more data than the characteristic autocorrelation time r, and avoid pairs of 
points closer together in time than r. Still, it is interesting to see in more detail what effect 
an almost 1/ f a power spectrum has on the numerically estimated correlation dimension. 

4 Qualitative discussion in terms of crinkliness 

The concept of fractal dimension can be applied to time series in two quite distinct ways: 
one, to indicate the number of degrees of freedom in the underlying dynamical system; and 
two, to quantify the self-affinity (or "crinkliness") of the trajectory through phase space. 
Although it is the first which is of interest here, the anomalous scaling observed in OP 
can be understood in terms of the second. Indeed, Osborne and Provezale explain the low 
dimension in terms of the fractal self-affine paths through phase space that are generated by 
the 1 / f a spectrum. While crinkliness is (at least intuitively) a high frequency phenomenon, 
I will argue that it is really the the low frequency or long time correlation that is important. 

Fractional Brownian motion 

An early association of fractal dimension with power- law spectra is cited by Mandelbrot and 
Van Ness [22] in their discussion of fractional Brownian motion. Here, an exponent H , with 
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< H < 1, describes the power-law "diffusion" rate: 




for small times T, while smooth motion leads to H = 1 for small T. A long term linear drift 
gives H = 1 for large T, and bounded motion implies H = in the large T limit. Fractional 
Brownian motion exhibits a 1/ f a power spectrum with a = 1 + 2i7. 

Graph dimension 

One way to quantify crinkliness is in terms of the graph dimension, which is defined as 
the fractal dimension of the set of points (t,x t ) G R 2 . The graph dimension for fractional 
Brownian motion is given by D g = 2 — H [23]. For 1 / f a noise, since H = (a — l)/2, we have 
D g = (5 — ct)/2 for 1 < a < 3. Note that D g saturates at one (that is, the graph loses its 




crinkliness) for a > 3. 
Dimension of a local segment 

Alternatively, one can define a fractal dimension for a segment of the trajectory itself. Let 
x^\ . . . , x[ m ^ denote m components of a multidimensional time series, and consider the 
trajectory of these m components in R" 1 . Over a small interval of time T, the distance 
between endpoints of the segments is equal to r = max; \xf* — x^}_ T \. The local dimension D 
is given by the scaling of "bulk" (number of points, T) with "size" (r); that is, T ~ r D . For 
fractional Brownian motion, ((x t — a^.y) 2 ) 1 / 2 ~ T H implies a dimension D = 1/H. Thus, 
for 1/ f a noise, the dimension is given by D = 2/ (a — 1) for 1 < a < 3, and D = 1 for a > 3. 

Recurrence time 

The dimension for a local segment of a fractal path is the same as the dimension of the full 
trajectory (in the N — > oo limit) only if the trajectory is non-recurrent. For a recurrent 
trajectory, the total number of points (the "bulk") within r of a fiducial point will be much 
greater than the T above, because a recurrent trajectory keeps coming back to the fiducial 
point. 

More crucial than the high frequency crinkles, then, is the relationship between the low 
frequency cutoff and the length of time series; and whether or not the trajectory through 
phase space is recurrent. If there is an explicit low frequency cutoff and an arbitrarily long 
time series, then eventually the trajectory will arbitrarily many times come back arbitrarily 
close to a fiducial point on the trajectory. On the other hand, if the length of the time series is 
limited, and the cutoff f Q is very small, then the trajectory may be effectively non- recurrent. 

For the non-recurrent case, the dimension of the full trajectory will be equal to the 
dimension of a local segment. But if the time series is long enough to be recurrent, the 
phase space will be "filled out" and the dimension estimated will be that of the embedding 
space. One property of 1/ f a noise (without cutoffs) is that it is not recurrent for m > 
2/(a — 1) where m is the embedding dimension. (As an example, an ordinary random walk 
exhibits a 1/f spectrum, and is known to be recurrent only in one and two dimensions. 
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In three dimensions, there is a finite probability that the trajectory will never return to a 
neighborhood of the starting point.) From this point of view, it is not surprising that the 
dimension of "pure" 1 / f a noise should saturate at D = 2/(a — 1) for large m. 

Finite dimension in non-power-law spectra 

Inspired by the results of OP, some researchers have looked for finite fractal dimension in 
time series with spectra that are not strictly power-law. For instance, Greis and Green- 
side [24] observe fractal scaling in a class of stochastic Langevin equations. This class in- 
cludes Ornstein-Uhlenbeck noise, the power spectrum of which is approximately constant for 
low frequency, and 1/ f 2 for high frequency. For small N, an apparent dimension of D = 2 
is observed; for larger N however, the dimension approaches the embedding dimension. It 
is important to distinguish the scaling these authors sometimes seek (which is a measure 
of crinkliness) from what correlation dimension attempts to quantify (which is number of 
degrees of freedom). In fact, correlation dimension is probably not a very good measure of 
crinkliness, compared to the graph dimension (for which recurrence is explicitly prohibited) 
or a direct measure of the diffusion exponent H. See Refs. [25, 26, 27] for discussion of 
numerical issues in this context. 

5 Explicit and implicit frequency cutoffs 

It is important to note that a pure 1 / f a power spectrum is physically impossible. Depending 
on whether a is greater or less than one, there will be an infrared or ultraviolet catastrophe 
- that is, the total power will diverge to infinity. There is therefore a need to introduce 
frequency cutoffs in order to analyze the properties of 1 / f a noise. 

Even without explicit frequency cutoffs, there are implicit cutoffs for real time series. 
The low frequency is effectively cut off by the finite length of the time series, and the high 
frequency is cut off by the sampling rate. 

In what follows a particular stochastic time series will be discussed. A 1/ f a spectral 
shape will be considered, with a cutoff at low freqency f Q and at high frequency f 1 . (I 
assume f\ ^> f .) In the absence of explicit cutoffs, I will take f Q ~ 1/-/V, and f\ ~ 1. From 
this model, the correlation integral C(N, r) will be derived analytically for various regimes 
of N and r, as a function of the cutoffs f Q and / 1? the exponent a, and the embedding 
dimension m (which, in general, will be taken to be large). 

The particular power spectrum that will be considered here exhibits 1/ f a scaling between 
a lower and upper cutoff frequency. Below the lower cutoff, the spectrum is flat, and above 
the upper cutoff, the spectrum is zero. 




for f<f 
for f <f<h 
for fi < f 




These cutoffs ensure that the time series has a well defined variance: 
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For a < 1, the high frequency cutoff is necessary to prevent an ultraviolet catastrophe, 
and for a > f, the low cutoff is necessary to prevent divergence at low frequencies. Since 
anomalous estimated dimension occurs for a > f, that is the case that will be considered 
here. Then, we can write 

*L == <* 2 > « ^T/o 1 -"- (4) 
a — 1 

The autocorrelation function can be expressed as a Fourier transform of the power spec- 
trum: 

oo 



A(T) := 4~ I co S (27rfT)S(f) df 



2 
rms 



2 

rms 



f° cos(27r/T)/ - a df + f 1 cos(27r/T)/-« df 

Jo Jf a 



(5) 



From this, the variance of the differenced time series can also be calculated; here 

4 := ((x t -x t _ T ) 2 ) = 2x 2 ms -2(x t x t _ T ) = 2x 2 ms (l-A(Tj) (6) 
4sm 2 (7rfT)S(f)df. (7) 

The behavior of this statistic at various ranges of T will prove to be important for subsequent 
analysis, and so this will be derived in some detail in the next section. 



6 Behavior of the statistic u\ := ((x t — xt-r) 2 ) 

In this section, I will calculate the statistic a\ for the example of truncated l/f a noise 
described above. This approach is just the reverse of Mandelbrot and Van Ness [22], who 
define fractional Brownian motion by requiring a priori that cry ~ T H for some exponent H. 
From Eq. (7), we can write 

4 = / /0 4sin 2 (7r/T)/ - a df + f h 4sin 2 (7r/T)/- a df 

Jo Jf a 
= Vo 1 - 

The asymptotic behavior of this integral expression for a\ can be evaluated in each of 
three regimes. We find cry ~ T for very short time T, cry ~ T^" -1 ^ 2 for T in an intermediate 
regime, and cry ~ constant for very large T. Fig. I shows how cry varies with T in each of 
these three regimes. 



sin(27r/ T) 
2vr/ T 



+ / l>in'iT// !/ ' df. 



(8) 



Figure 1: Variation with T of the statistic <j|> = ((x t — Xt-r)' 2 ) 
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6.1 I: Very short time, T <C 

Here, sin irfT ~ 7r/T, and 1 — sin(27r f T) / (2irf T) ~ |(27r/ T) 2 , so Eq. (8) can be approxi- 
mated 



2 _ 



2/, 



L6 

(2ttT) 2 (/, ; 



+ 4tt 2 T 2 / 2 " a 



(/i 3 - - ^fo 3 ~ a ) /(3 - a) 



|2_ r 3- aT 2 for a 3 
47r 2 a r 3 — arri2 



3(3-a) 



//- a T 2 for a > 3. 



(9) 



Thus, cry ~ T for small T, which implies no crinkles; this arises as a direct consequence 
of the high frequency cutoff. 

6.2 II: Intermediate time, f//i < T < 1 / f Q 
Now, let u = fT, so 



On 



where the multiplier 



-f 3 - a (27rT) 2 + AT 
1(a) : = 



a-l 



AT 



foT 



sin 2 (7m) u a du 



sm ttu u 



~ a du 



(10) 



(1 



converges to a constant for f < a < 3, and is of order unity as long as a is not too close to 
f or to 3. (Asymptotically, 1(a) ~ 2(0-1) for a — > 1 ; and 1(a) ~ for a — > 3.) 



Figure 2: Integral 1(a) = J Q °° sin 2 (7rw) 



du 



The scaling in this intermediate regime corresponds to the scaling postulated for fractional 
Brownian motion, with H = (a — f)/2. 

For a > 3, the integral in Eq. (ff) diverges, and it is not justified to approximate the 
lower limit f Q T as zero. In this case, we obtain that the leading order is equal to the small 
time limit given in Eq. (9). In this case, there is no distinction between regimes I and II. 
Intuitively, a > 3 corresponds to not having enough high frequency to "crinkle" the path. 



6.3 III: Very long time, l/f < T 
In this case, let u = f / f , s ° 

sin(27r/ T) 



On 



1-a 



1 



2/ 



2vr/ T 
sin(27r/ T) 
27t/ T 



l-a f^ 1 ^ . 2 

+ 4/ a J sin (irf Tu) u a du 



+ 2/ 



[f — cos(27r/ Tu)] u a du 
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2/ Q 



Vo 



sin(27r/ T) 



+ 2/ 



a 



a - 1 



a- 



os(27r/ r) 

(2vr/ T)2 



But, a; 2 



fc, 1 " I \ a ~ 1), so we can write 



rr 2 ~ 

(J 71 



2x 



2 

rms 



sin(27r/ T) cos(2irf T) 
H : — — a- 



(a - 1) 2irf T 



(2vr/ T) 5 



COs(27T/ o r) 

(2vr/ T) 2 



(12) 



(13) 



6.4 Crossover times 

The crossover times between these behaviors occurs roughly at T ~ l//i and T\ ~ l/f - 
However, it is possible to more precisely specify these times by equating the formulas for o\ 
in the different regimes. 

For the first crossover time, assume 1 < a < 3 (otherwise, there is no crossover behavior), 
and write (27rT) 2 / 1 3_a /(3 — a) = 4/(a)T a_1 , which, upon solving for T, gives the crossover 
time 

1/(3-*) 



T 



(3-a)I(a) 



7T Z 



For the second crossover time, write 4/(a)T" 1 = 2f Q x a / \a — 1) to obtain 

T 1 = [2( a -l)I( a )/ a ] 1 ^ 1 - a \l/f ) 



(15) 



These coefficents of the l//i and l/f Q are "only" numerical factors, but they are still 
useful for more precisely pinning down the scaling range. 2 

In particular, note that the range for intermediate (fractal) scaling of cry with T is given 

by 

^ = A»f (16) 

-L o Jo 



where the numerical factor is given by 



1/(1-*) 



K(a) 



[2(a - 1)1 (a) /a] 
[(3-a)/(a)/7r 2 ] 1/(3 - a) 



(17) 



For a near 1 or 3, this factor can be extremely small, which implies that the range over 
which anomalous fractal scaling is observed can also be very small. This also partly explains 
the difficulty that both Fox [26] and Higuchi [27] describe in getting numerical agreement 
with theory for a near 1 and 3. 



2 For example, there is an unexplained factor of approximately ^ that appears in Figure 3 of Higuchi's paper [25]. His graph 
dimension of 1.21 corresponds to a = 2.58, which gives f a % 0.15/T o , which is consistent with the figure. 
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6.4.1 Implicit frequency cutoffs 

In the absence of explicit cutoffs, the effective frequency cutoffs are defined by crossover 
times of T = 1 and T\ = N. From Eqs. (f4, f5), this gives 

fi = [(3-«)/(«)/7r 2 ] 1/(3 " a) (18) 
f a = [2(a - l)I(a)/a] 1/{1 ~ a) /N (19) 

6.5 Characteristic length scales 

Associated with the characteristic times T = 1, T = T Q , and T = T t are characteristic 
distance scales. Define 

( 2 7 r,/^/ 1 ( 3 - a )/ 2 for«<3 
X^^l^fo^' 2 for a > 3. 

and 

i? x = V2v To = [Mia)] 1 ' 2 ^-^' 2 . (21) 
In between these two distances a third will be defined 

R* = [8/(a)] 1/2 . (22) 

Note that in the case T Q = 1 (no explicit high frequency cutoff), we have = R t = R Q . 
Finally, the largest distance scale is a; rms which is also equal to v^CTi • An expression for this 
appears in Eq. (4). 

7 Derivation of the correlation integral 

Using the results and definitions of the previous section, I will derive an analytical expression 
for the expected value of the correlation integral C(N,r) in the case of l/f a noise with 
cutoffs, and then show how the expression scales in various regimes of N and r. 

Consider Py(r) as the probability that a pair of points (x t _T,x t ) satisfies (x t — a; t _y) 2 < r 2 . 
For the gaussian noise considered here, Px( r ) = er f(yf — )■ ^ ^ e trajectory is embedded in 
m dimensions, this becomes Pt(t) = er f(y| — )"\ assuming the or "max" norm. This 
exact form relies on the assumption that each component of the embedded time series is 
independent. In OP, this is ensured by taking separate realizations of 1/ f a noise for each 
coordinate. Although a delay-time embedding does not strictly satisfy this criterion, it has 
been observed numerically that the same qualitative results are obtained [6, 8]. 

Now the correlation integral is given by C(A, r) = (2/A 2 ) J2t(N — T)Py(r). Thus, 

2 N 



C(N,r) = — E(iV-T)erf(^=-r (23) 
is an exact expression for C(A, r) in terms of the statistic cry. 
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It is important to recognize that the function erf has as asymptotic behavior, erf(a;) ~ x 
for small x, and erf(a;) ~ 1 for large x. Thus the sum can be broken into two components, 
one for which r < \[2g^ an d ° ne f° r which r > \[2g^. Let T(r) define where this transition 
occurs: 

V2<Tx(r) = r. 



Then, 



C(N, 



2 

iV 2 

2 

iV 2 
2 

iV 2 

2T(j 



T(r) 



N 



J2(N - T)erf(-^r + £ (N - T)eri(-=—\ 

r=l V Z(Jt T=T(r) V 



T: 

'T(r) 



N 



T=l 



T=T(r) 



V2< 



Of 



N 



N 



(NT(r) - -T{rf) + r m 2~ m/2 ^ (N - T)a^ 

^ T=T(r) 

N N 

E N °T m - E T °T 

T=T(r) T=T(r) 



T(r) 
2N 



cyl-m/2 
l r 

N 2 



(24) 



(25) 



This expression, though a bit unwieldy as it stands, will be shown to exhibit a variety of 
different scalings in different regimes, beginning with the fractal scaling observed in OP. 



7.1 Regime of fractal scaling 

Assume that r is sufficiently large, and N is sufficiently small that the scaling cry ~ 
holds for all T(r) < T < N. This requires a < 3, and N < T\ ~ an d 2^( r ) ^ T Q ~ 
or r > \/2ot o = This argument will also require m > 4/(a — f). Now, 



N 



J2 Na^ m « 

T=T(r) 



AT 
T(r) 

iV[4/(a)]- m / 2 
f - m(a - f)/2 

iV[4/(a)]- m / 2 
f - m(a - f)/2 



N 



iV[4/(a)]- m/2 / rp-m(a-l)/2 
JT(r) 



T(r) 1_m ^~ 1 ^ 2 — iV 1- " 1 ^ -1 )/ 2 



\l-m(a-l)/2 



i(a-l)/2-l' 



iV 



The second term is negligible if iV ^> ^( r ) an d m > 2/ (a — 1 ), and in this case 



N 



Similarly, 



E N °T 

T=T(r) 

N 

E T °T 

T=T(r) 



N[AI(a)] 



-m/2 



1 - m(a - f)/2 

[47(a)]"" 1 / 2 

2 - m(a - f)/2 



T(r 



T(r 



,l-m(a-l)/2 



,2-m(a-l)/2 



(26) 



(27) 



(28) 
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but this time, the condition for neglecting the extra term is that m > 4/(a — 1). Also, note 



that since A ^> T(r), we can neglect the YIt 



I ' —7 

T(r) 1 (J T 



term because it is much less than 



Nan 



&=T(r) 1,U T 

For convenience, write 

D := 2/(a - 1) 

Now, use a T(r) « [4/(a)] 1 / 2 T(r)( a - 1 )/ 2 (from Eq. (10 
Eq. (24) which defines T(r), and Eq. (22 

D 



w 



which defines i?*, to obtain 



hich is valid in this regime) 



T(r) 



r 

R* 



(29) 
with 

(30) 



Then from Eq. (25), we have 
C(N,r) 



2 

A 
2 

A 



d 2 1_m / 2 
H — — r' 



m 



2D 



m 



D 



A 2 
r 

R* 



N(R m /y/2)- 
1-m/D 



-T(rf- 



i/D 



D 



(31) 



This demonstrates the C(A, r) ~ r D scaling that was observed in OP. Apart from pro- 
viding an alternative derivation of the scaling that is applicable directly to the correlation 
integral, it gives a sense of how observable the phenomena might be. For instance, it shows 
that limjv^oo C(N, r) = 0. For sufficently large A, as will be shown in the next section, the 
contribution of the fractal term becomes negligible, and C(A, r) ~ r' m scaling will dominate. 

The m dependence of the expression above for C(A, r) is also interesting. (First of all, 
recall that the expression was derived under the assumption m > 2D and so is valid only for 
large m.) Grassberger and Procaccia [28] define an entropy K 2 which expresses the scaling 
of the correlation integral with m. 



C m (N,r) 



~ e 



(32) 



This kind of exponential dependence on m is not seen in Eq. (31). In contradiction to what is 
claimed in Ref. [6], I find that the correlation entropy does not have a "finite and predictable 
value", but instead gives K 2 = in the limit m — > oo. 



7.2 Very large A 

For A sufficiently large, I can show that C( A, r) will not exhibit the fractal scaling observed 
in OP; but instead will scale as C(A, r) ~ r' m indicating a stochastic system. First, break 
Eq. (23) into two terms: 



C(A, 



2 



T=l 



N 



T)erf( 



a/2, 



(33) 



The first term was calculated in the previous section on fractal scaling. To calculate the 
second term, note that cry ~ \/2a; rms for all T > 2~i, so for r < 2a; rms , we have 

D 



C(N, 



2 
A 



m 



m 



2D 
^D~ 



r 



+ (1 - 2TJN) 



(34) 
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This sum will be dominated by the second term as long as 

This implies that the number of needed data points N increases for smaller r . If this condition 
holds for all r > i?*, then 

flf>2 (^)% 2 (_i_)- /, / ^-.,/« (36) 

in which case 



C(^r)=i— (37) 

Note that in this case, unlike the case of the fractal scaling regime, the limit liimy^oo C(N^ r) 
is nonzero. It is this scaling that ultimately dominates, although as Eq. (36) shows, N may 
have to be extremely large before this happens. 

As this argument shows, it is the low frequency cutoff f Q that is crucial. Another way to 
think of this is in terms of the average recurrence time T(e) for a trajectory to come back 
to within e [7]. In this case, 

9 m \rv I ( rv — 1 W' 171 ! 2 

T(e) = lim 1/C(N, e) = (2x ims / ' e) m = _UL_^J_ /o ™U--)/2 (38) 

If N ^> T(e), then good scaling is expected for C(N,r) for r > e. Note that for large 
embedding dimension m, the data requirement is extreme; so that even to say A^ ^> 1/ f does 
not guarantee that spurious scaling will not be observed. In the case of Ornstein-Uhlenbeck 
noise, a = 2 and r = 1/ f , so that this condition becomes A^ ^> r" 1 / 2 , in agreement with one 
of the main results of Ref. [8]. 

7.3 Effect of high frequency cutoff fi 

While a low frequency cutoff is essential to prevent an infrared catastrophe, the high fre- 
quency cutoff is optional for a > 1. When there is a high frequency cutoff, the trajectory 
loses its crinkly shape at small distances, and locally looks one dimensional. This can be 
reflected in the correlation integral as a scaling of C(N, r) ~ r over a range of r. 

In the range T < T Q , we have cry = cr T=1 T and Eq. (9) gives an expression for cr T=1 . 
Combining this with Eq. (20) for R D and Eq. (25), we can write 

<Wr) = (39) 
This scaling is exhibited in the range R D < r < R t . 

7.4 a > 3 

If a > 3, there is not enough high frequency to crinkle the trajectory. The smooth trajectory 
is locally one-dimensional, and a scaling C(N, r) ~ r is observed. In particular, 
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valid for the range R < r < x r 



7.5 Very small r 

For r sufficiently small that r < \[2a^ f° r ah T (that is, for r < R Q = ^/2cx=i), it is possible 
to approximate erf(r/-\/2crx) by r/v^cx- I n this case, Eq. (25) becomes 



C{N, 



2l-m/2 



iV 2 



AT 



AT 



,T=1 



T=l 



2 

iV 



N 



52(l-T/N)(<rr/a T=1 ) 



T=l 



/ r 



(41) 



The term in the square brackets is a constant, and in the limit m — > oo approaches unity, so 

2 / r 
* N \R n 



C(N, 



(42) 



In general, regardless of the cutoffs, we find C(N, r) ~ r' m for small r; in particular, the "lower 
half" of the curve, where C(N, r) < 2/N, will exhibit r' m scaling indicating a stochastic time 
series. Since C(N,r) > 2/N 2 , the range in r over which this scaling is valid is i? A^ _1 / m < 
r < R n . 



8 Conclusions 

The conclusions can be presented as a "phase diagram" in the (iV, r) plane. As Fig. 3 shows, 
as many as three different scaling exponents can be seen. For sufficiently large N , only the 
correct C ~ r' m scaling is observed, though N may have to be extremely large for this regime 
to be achieved. The case considered in OP involves no explicit frequency cutoffs, and can 
be recovered with 1 / f ~ N and fi ~ 1. In this case, anomalous scaling of C ~ r 2 ^ a_1 ^ 
is observed for large r and correct scaling of C ~ r' m is observed for small r. If the high 
frequency cutoff is significant (/ x < 1), then a third regime is seen, in which C ~ r scaling 
is observed. 



Figure 3: Phase diagram of scaling in (N,r) plane 

Fig. 4 shows a representative correlation integral, computed from a time series of f / f a noise 
with high and low frequency cutoffs. Notice that three different scaling regimes are visible. 

Figure 4: Correlation integral C(N,r) for l/f a noise. 

I should emphasize that all these calculations were done using the original definition of 
correlation integral, specified by Grassberger and Procaccia [2]. These anomalous regions are 
further restricted and can even be eliminated if the modified correlation integral defined in 
Ref. [8] is used. For practical estimation of correlation dimension, it is this modified version 
that is preferred. 
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Figures 



Fig. 1. Variation of with T. 

This figure shows the the three regimes (I, II, and III) of T as defined by the behavior of 
the statistic cr 2 ^ = ((x t — a; t _y) 2 ). Shown are the crossover times T Q and T x . Also shown are 
the distance scales i?*, and a; rms . Note that the axes are logarithmic. In regime I, 
Uj, ~ J 1 ; in regime II, <7y ~ 

j'("-i)/2. and 

in regime III, cry approaches a constant. 
Fig. 2. The integral 1(a) = Jo°° sin 2 (7ru) u~ a du. 

A number of results derived in this Letter are expressed in terms of this integral, which 
cannot be explicitly solved. Although 1(a) is often treated as being of order unity, note that 
it can be quite large for a near 1 or 3. 

Fig. 3. Phase diagram of scaling in ( A, r) plane. 

This figure shows how C( A, r) scales with r in various regimes of the (A, r) plane for 
l/f a noise with cutoffs as defined in Eq. (2). This figure assumes 1 < a < 3 and that 
the embedding dimension m is large. The axes are logarithmic. 

Since the input is inherently stochastic, the correct scaling is C ~ r m . This scaling is 
observed in areas (a), (b), and (c). In (a), A is sufficiently large (see Eq. (36)) that correct 
scaling is seen for all r. For small r < in (b), correct scaling is observed for all values of 
A. In the third regime, (c), with both r and A large (according to Eq. (35)). 

If there is a high frequency cutoff, _/i <C 1, then there will be an intermediate regime 
(d) for which C ~ r scaling is observed. Note that the width of this regime is given by 
R\l R ~ I/A? an d that if there is no explicit high frequency cutoff, then f 1 ~ 1, and this 
regime vanishes. 

If r > a; rms , in area (e), then C( A, r) will have saturated to unity, and the trivial scaling 
C ~ r° will be observed. Further, there will be no scaling in area (f), where r so small that 
C(N 7 r) < 2/A 2 , since there are no distances that small. 

Finally, the regime (g) of large r and small A exhibits the anomalous fractal scaling 
C ~ r 2 /( a_1 ) that was observed in OP. If there is no explicit low frequency cutoff (the case 
considered in OP), then the effective cutoff is given by f = 1 /A, and this anomalous scaling 
will be observed even for arbitrarily large A. 

Fig. 4. Correlation integral for 1 / f a noise. 

The correlation integral was computed for embedding dimension m = 1,2,..., 15, using 
A = 16384 points in a time series of 1/ f 2 noise with high frequency cutoff fi = 0.05 and low 
frequency cutoff f = 0.00006 (equivalent to no explicit low frequency cutoff). One observes 
C ~ r' m for small r; C ~ r for a very small range of intermediate values of r; and the 
anomalous scaling C ~ r 2 ^ a_1 ^ = r 2 for large r. 
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